NLify: Third-Party Programming Support for Spoken Natural Language Interfaces
نویسندگان
چکیده
This paper presents the design and implementation of a programming system that enables third-party developers to add spoken natural language (SNL) interfaces to mobile applications. Existing systems either restrict SNL capabilities to first-party applications or limit developer-defined spoken interactions to keyphrases rather than broad natural language. An examination of expert workflow reveals that the primary challenge is in gathering comprehensive sets of paraphrases for each command and in selecting and tuning corresponding statistical models for speech and language processing. We address the former problem by integrating automated statistical machine paraphrasing and webscale crowdsourcing into the developer workflow. We address the latter by developing a classifier architecture designed to be robust across app domains. We have realized our design fully as an extension to the Visual Studio IDE. Based on a new benchmark dataset with 3500 spoken instances of 27 commands from 20 subjects and a small developer study, we establish the promise of our approach and the impact of various design choices.
منابع مشابه
SplineAPI: A REST API for NLP Services
Modern applications often use Natural Language Processing (NLP) techniques and algorithms to provide sets of rich features. Researchers, who come up with these algorithms, often implement them for case studies, evaluation or as proof of concepts. These implementations are, in most cases, freely available for download and use. Nevertheless, these implementations do not comprise final software pa...
متن کاملTowards Portable Controlled Natural Languages for Querying Ontologies
Natural Language Interfaces (NLIs) to structured data allow users to interact with a system using written or spoken language to perform tasks that require knowledge of a formal language. Due to natural language complexity and ambiguity, such interfaces usually support a Controlled Natural Language (CNL): a subset of a natural language that includes certain vocabulary and grammar rules that have...
متن کاملPerceiving the GUISE: Graphical User Interface Specification Extraction
We present a dynamic control-flow analysis and state classifier for graphical user interfaces. Search engines, end-user programming interfaces, and automated testers exploit such information, but are challenged by clientside and serverside scripts obscuring it: our analysis succeeds on popular web applications that contain both. We further motivate such analyses. First, we introduce a new type ...
متن کاملSpoken Dialogue Interfaces: Integrating Usability
Usability is a fundamental requirement for natural language interfaces. Usability evaluation reflects the impact of the interface and the acceptance from the users. This work examines the potential of usability evaluation in terms of issues and methodologies for spoken dialogue interfaces along with the appropriate designer-needs analysis. It unfolds the perspective to the usability integration...
متن کاملSpeechActs: A Framework for Building Speech Applications
Introduction SpeechActs is a prototype framework for building and integrating multiple speech applications. The eventual goal of the SpeechActs research project is to allow users to telephone their Sun™ workstations and interact with a wide range of Solaris ® applications. Instead of using touch tones, our aim is to allow users to accomplish most or all of the interaction using continuous speec...
متن کامل